A Model Ground State of Polyampholytes 
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The ground state of randomly charged polyampholytes is conjectured to have a structure similar 
to a necklace, made of weakly charged parts of the chain, compacting into globules, connected by 
highly charged stretched 'strings'. We suggest a specific structure, within the necklace model, where 
all the neutral parts of the chain compact into globules: The longest neutral segment compacts into 
a globule; in the remaining part of the chain, the longest neutral segment (the 2nd longest neutral 
segment) compacts into a globule, then the 3rd, and so on. We investigate the size distributions of 
the longest neutral segments in random charge sequences, using analytical and Monte Carlo meth- 
ods. We show that the length of the nth longest neutral segment in a sequence of N monomers is 
proportional to N/n 2 , while the mean number of neutral segments increases as y~N. The polyam- 
pholyte in the ground state within our model is found to have an average linear size proportional to 
\/N, and an average surface area proportional to N 2 ^ 3 . 
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I. INTRODUCTION 

The desire to understand long chain biological macro- 
molecules, and especially proteins, stimulates extensive 
studies of polymers An important class of poly- 

mers are those with an electric charge along their back- 
bone. This work deals with heteropolymers, which carry 
positive and negative charges, known as polyampholytes 
(PA's) j| . Models of PA's are important to the study of 
proteins, since under normal physiological conditions, 5 
of the 20 naturally occurring amino acids have an excess 
charge [Q. We consider a polymer of charged monomers, 
interacting via unscreened Coulomb interactions, and we 
investigate its ground state structure. Throughout this 
work, we discuss PA's that consist of a random mixture of 
positive and negative charges, which cannot move along 
the chain. 

We are interested in the geometrical features of the 
ground state of a PA, and in particular in the depen- 
dence of its radius of gyration (r.m.s. size) R g on the 
number of monomers N: N v . Higgs and Joanny g] 

and later Wittmer et al. j6| , elaborated on arguments of 
Edwards et al. Q and suggested, on the basis of a Debye- 
Hiickel type theory ||, a collapsed structure (y = 1/3) 
for neutral PA's in the ground state: The chain takes ad- 
vantage of the presence of two types of charges along its 
backbone, and assumes a spatial conformation in which 
every charge is predominantly surrounded by charges of 
an opposite sign. 

A different approach to the study of the ground state 
of PA's |^] , is by scaling arguments, requiring that the in- 
teraction energy be the same on all length scales. These 
arguments lead to a stretched structure of the ground 
state, where R g ~ N. In this approach R g is averaged 
over a complete ensemble of all quenches, and the typical 
overall (excess) charge Q of the PA is ~ yN. Whereas in 
the Debye-Hiickel approximation, all the chains are neu- 
tral, since the overall neutrality of the PA is an essential 
condition for the validity of the Debye-Hiickel approx- 



imation. The extreme sensitivity of the ground state 
structure to the excess charge, noted by Kantor et al. 
]lO| , pl| , and supported by Monte Carlo (MC) simulations 
as well as variational mean field calculations jl2) , resolves 
the apparent contradiction between the scaling and the 
Debye-Hiickel motivated arguments. 

To gain some insight into the behavior of PA's, analo- 
gies to charged drops were explored ]l3|^15[|: A spher- 
ical drop charged beyond a certain charge, called the 
Rayleigh charge Qr, which depends on the surface ten- 
sion and volume of the drop, becomes locally unstable 
to elongation, since the pressure difference between the 
inside and outside of the drop vanishes. Even before the 
total charge reaches Qr, the drop becomes unstable to 
splitting into two equal drops. Additional splittings of 
the drop occur for larger charges. Similar behavior is 
expected in PA's charged to Qr ~ yN. Although a 
PA cannot split, the analogy to charged drops can still 
be exploited: Constraining the structure to maintain its 
connectivity by attaching droplets with narrow tubes, re- 
sults in a necklace-type structure of droplets connected 
by strings. 

For homogeneously charged polymers (polyclcc- 
trolytes), the charged drop analogy was used |l6| to 
characterize the structure completely within the neck- 
lace model (including the number of 'beads' and strings, 
their sizes, and the number of monomers in them, for a 
given temperature and charge density). However, trying 
to apply the necklace model to quenched PA's having 
random charges [^3|Jl4[ , several difficulties occur due to 
the randomness. It was noted, for instance, that a situa- 
tion occurs, in which most spherical shapes are unstable, 
while there is on average no energetic gain in splitting 
the sphere into two equal parts. A consistent theoretical 
picture for random PA's beyond the instability thresh- 
old was not found, but a typical PA is conjectured to be 
composed of rather compact globules connected by long 
strings. In order to reduce the electrostatical energy, the 
globules consist of segments of the chain that are approxi- 
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mately neutral (collapsing according to the Debye-Hiickel 
picture), while the strings are formed by highly charged 
segments. 

Since the structure of a randomly charged PA cannot 
be characterized completely analytically, we turn to nu- 
merical MC methods in order to characterize the ground 
state of such PA's within the necklace model. A key 
role in the structure of randomly charged PA's is played 
by the neutral segments in the chain (forming the beads 
in the necklace). We therefore apply MC methods to 
study the neutral segments size distribution of randomly 
charged PA's. The rest of the paper is organized as fol- 
lows: In section H we describe the process of construction 
of a ground state for a randomly charged PA, by dividing 
it into neutral segments. We discuss the motivation for 
this process, define the important parameters of the prob- 
lem and compare the process to similar existing models. 
In section |l| we investigate the sizes of the globules in 
the suggested ground state, including their dependence 
on the total number of monomers and on other param- 
eters. In section IV we discuss effects of finite chains, 



investigate the dependence of the total number of neu- 
tral segments in such finite chains on N, and construct 
a self consistent picture of the structure of the ground 
state. In section [V] we obtain some of the physical char- 
acteristics of the constructed ground state, such as its 
linear size and surface area, and in the last section we 
discuss our results and compare them to other studies. 



II. THE MODEL - MOTIVATION AND 
DEFINITIONS 

Empirical observations of the energy of randomly 
charged PA's suggested that the quench-averaged energy 
can be presented as a sum of condensation, surface and 
electrostatic energies 13 1^]: 



E =-^N + 1 S+ ( ?- 
a R 



(1) 



where q^/a is the condensation energy gain per parti- 
cle (qo is the typical charge of a monomer and a is a 
microscopic distance such as diameter of the monomer), 
7 ~ Qo/ - 3 i s the surface tension, and R is the linear size 
of the chain. (In Eq. (Jjj) we omitted dimensionless pref- 
actors of order unity.) When the excess charge Q is very 
small, the PA will form a single globule. When Q in- 
creases, the electrostatic forces will overcome the surface 
tension and the globule will split (similarly to the split- 
ting of a drop charged beyond the Rayleigh charge). Nu- 
merical studies suggest [p~3| p~5|] that a PA forms a necklace 
of weakly charged globules, connected by highly charged 
strings. This structure is a compromise between the ten- 
dency to reduce the surface area (i.e. to form globules) 
due to surface tension, and the tendency to expand, in 
order to reduce the Coulomb interaction caused by the 
excess charge. 



Kantor and Erta§ Jl 7| — 11 9|] attempted to quantify the 
qualitative necklace model, by postulating that the 
ground state of a PA will consist of a single globule, 
formed by the longest neutral segment of the PA, while 
the remaining part will form a tail. They investigated the 
probability that the longest neutral segment in a chain 
of N monomers has length L. A probability density was 
defined and investigated in the limit where N,L — > oo, 
while the reduced length I = L/N is fixed. The proba- 
bility density was obtained numerically and investigated 
analytically, but a complete analytical solution to the 
problem was not found. We follow p7| , and investigate 
the problem of the size distribution of neutral segments 
in randomly charged PA's by mapping the charge se- 
quence into a one-dimensional random walk (1-d PW). 
The charge sequence to — {qi} (i — 1,...,N; % = ±1) 
is mapped into a sequence of positions Si(u>) = J2]=i 1j 
(Sq = 0) of a random walker. (From now we will measure 
charges in units of the basic charge qo, and therefore qj 
will be dimensionless.) The random sequence of charges 
is thus equivalent to a RW, a chain segment with an ex- 
cess charge Q corresponds to a RW segment with total 
displacement of Q steps, and a neutral segment is equiv- 
alent to a loop inside the RW. Throughout the paper we 
will use the terminologies of randomly charged PA's and 
of RWs interchangeably. 
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FIG. 1. An example of a charge sequence lu, mapped into a 1-d 
RW Si(u>), and a typical loops structure. Filled circles indicate the 
starting and ending points of loops. The longest loop in the RW 
has 8 steps (Li = 8), the 2nd longest loop has 6 steps (L2 = 6), 
L3 = 4 and L4 = 2. The excess charge (which is equivalent to the 
total displacement of the RW) is Q = +8 and the total length is 
N = 28. 

When the longest neutral segment forms a globule (as 
assumed in Jl7|-^9[), the remaining part of the chain is 
very large (~ N)7 It is natural to assume that neutral 
segments on that tail will further reduce the total en- 
ergy by folding into globules. Eventually, the necklace 
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will consist of many neutral globules. However, there are 
many ways in which the chain can be divided into neu- 
tral segments, and we are interested in a simple unique 
structure. We therefore suggest a specific necklace-type 
structure, and construct the ground state for a randomly 
charged PA in the following way, depicted in Fig.[j]: The 
longest neutral segment contains L\ monomers; it com- 

1/3 

pacts into a globule of linear size proportional to ; in 
the remaining part of the chain the longest neutral seg- 
ment (the 2nd longest neutral segment) of size L 2 also 
compacts into a globule, then the 3rd and so on, until the 
segments become very small (of only a few monomers). 
Generally, L n denotes the number of monomers in the 
nth longest neutral segment, which compacts into a glob- 

1/3 

ule of radius L„ . Eventually, all the neutral segments 
are exhausted and we are left only with strings which 
carry the PA's excess charge Q, and connect the glob- 
ules. The total number of monomers in the chain is the 
number of monomers in all the neutral segments, plus 
the excess charge. 

The 'ground state', generated by this process of longest 
neutral segments compacting into globules, does not 
necessarily minimize the total energy of the PA. First 
of all, the process does not consider the possibility of 
weakly charged globules, which can include much more 
monomers than the neutral globules, thus compensating 
by surface energy for the additional electrostatic energy. 
Secondly, even when considering only neutral globules, 
it is not necessary that the procedure of compacting the 
longest neutral segment at each step generates the low- 
est energy state: It is possible that two 'medium sized' 
globules will have lower surface energy than a long one 
and a very short one (which remains after the first long 
segment was already chosen). 

A similar model, describing a breaking of a RW into 
loops, is the Loop Erased Self- Avoiding Walk (LESAW) 
model @. A LESAW is constructed from a RW, 
in which any loop generated by a self intersection is 
erased. The LESAW model was subject for intensive 
study ]2l|-p5[| , and many properties of the lengths of the 
erased loops in it were found. However, these results do 
not consider issues of the longest erased loops, and are 
therefore of little help to us. 

Other related models are those of randomly broken 
objects p6| , p7[ |, in which an object, such as a segment 
of unit length, is divided into mutually exclusive parts 
by a self-similar random process. The division of a seg- 
ment into parts resembles the 'breaking' of a RW into 
loops: The probability densities of the longest segments 
in some models of randomly broken objects resemble the 
probability density of the longest loop in fll8|| . However, 
one main difference between models of randomly broken 
objects and the longest loop problem, is that the prob- 
abilities of the longest loops are not self-similar. (The 
probability of a certain fraction of the chain to form the 
longest loop and the probability of the same fraction of 
the remaining chain to form a second longest loop are dif- 



ferent.) We will return to this absence of self-similarity 
in section 



III. 



III. 



SIZE DISTRIBUTION OF NEUTRAL 
SEGMENTS 



In order to characterize a PA in the necklace-type low 
energy state, made of neutral globules and highly charged 
strings, we are interested not just in the size distribution 
of the longest neutral segment, but also in the size dis- 
tribution of the 2nd longest segment, 3rd longest, and 
so on. We examined the statistics of loops in a 1-d 
RW of A steps, using MC method for several N's, up 
to N — 10 4 . For each A we randomly selected 10 6 se- 
quences. For each sequence we found the lengths L n of 
the (non-overlapping) loops, and calculated their distri- 
butions and averages. 

We denote by PN,n{L n ) the probability of the nth 
longest loop in a RW of N steps to be of length L n . The 
average length of the longest loop was found to be 

proportional to A. Expecting the same behavior for the 
length of the nth longest loop, we define the probability 
density of the nth longest loop: 



Pn{k 



N 



[P N , n (L n ) + P Nt „(L n + l)} , 



(2) 



where l n = L n /N. (Note that at least one term in 
Eq. (||) vanishes, since loops can be only of even length. 
Therefore, definition (Q) includes average of probabilities 
for L n and L n + 1 as in the definitions used in contin- 
uum limits for discrete RWs, in order to prevent even- 
odd oscillations.) For small N's this definition of p n (l n ) 
still depends on N, but in the A, L n — > oo limit it be- 
comes a function of only Numerical evidence for the 
A— independence of pi(h) for A — -> oo was presented in 
|l8| . MC results for p n (l n ) for several values of n indi- 
cate that the probability densities are virtually indepen- 
dent of N |28[] . For small A and large n the probability 
density p n (l n ) depends on A, since L„'s are short, and 
continuum limit is expected only when L n 3> 1. In or- 
der to overcome effects of finite N, we determined the 
A— dependence of (L n ), through an extrapolation of the 
slopes of the linear fits of log(L„) vs. log A to A — > oo. 
For any given n we calculated, through a local linear fit, 
slopes for several values of N, and estimated them at 
N — > oo limit. Furthermore, it can be proven analyti- 
cally p]| that the probability density of the longest loop 
is 'universal', i.e. for long RWs, it does not depend on 
the number of steps, or on details of a single step. 

Since p n (l n ) are N— independent, we can investigate 
them for any given (large enough) A. The probability 
densities of the five longest loops in a chain are depicted 
in Fig.|| for N = 1000. Several properties of p n {l n ) are 
evident from this figure. The probability density pi(l\) 
was shown |18 to have a square root divergence at l\ = 1, 
and a discontinuous derivative at l\ = i. When the 
length of the first loop L\ in a specific chain is long, 
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then the lengths of the other loops L n (for n > 1) in 
that chain must be short, since the total length of all the 
loops cannot exceed N. Therefore, since in many chains 
the length of the longest loop is almost equal to N (as 
indicated by the divergence of pi{h) when l\ — ► 1), the 
lengths of the other loops approach zero. This is evident 
from the divergence of p n (ln) when l n — > for all n > 1. 
Because in any specific chain the length of the nth longest 
loop is shorter than the length of the fcth longest loop, 
for n > k, the divergence of p n (l n ) near zero is stronger 
for large n. Since the probability densities p n (l n ) are 
normalized separately for each n, then any two of them 
must intersect (i.e. p„(/„) always intersects p n '(l n >) for 
n 7^ n'). The length of the second longest loop never 
exceeds the length of the first longest loop, and the sum 
of their lengths never exceeds N. Therefore, the length 
of the second longest loop cannot exceed half the length 
of the chain. Consequently, I2 < \ for all the chains, and 
Pii^-i) vanishes identically for I2 > \- Similarly we can 
show that l n < 1/n for all n and p n (l n ) = for l n > 1/n. 




FIG. 2. Probability densities of the 5 longest loops from 
right to left: pi(Zi)-thick points, p2('2)-solid line, p3(i3)-dashed 
line, p4(/4)-dot-dashed line, p5(is)-dotted line, as a function of 
l n = L n /N from MC results of 10 6 random sequences of length 
N = 1000. 

We note that P2C2) is qualitatively similar to the prob- 
ability densities of the length of the second segment in 
different models of randomly broken objects |26|]27f| (see 
Figs. 1(b) and 3(b) in p6[ ). However, the probability 
densities of the length of the second segment in |g6[ have 
strong singularities (of the first derivative) at I2 = 1/4, 
and are shown to have singularities at I2 — 1/k for all 
integer k > 2. All the probability densities p n {l n ) of the 
longest loops have a singularity when they become iden- 
tically zero (p n (l n ) vanishes for l n > 1/n and therefore 
is non-analytical at l n = 1/n). It is possible that due to 
the singularity in p n i (l n i ) at 1 /n' all the other probabil- 
ity densities p n {l n ) (for n < n') also have singularities 



at since all the probability densities are dependent. 
Apparently, these singularities do not cause discontinuity 
of the first derivative, and, therefore are not visible in the 
numeric data. In several models of randomly broken ob- 
jects p^ , p7| , the probability densities of the length of the 
nth segment have singularities at / = 1/n' (for n < n'). 
These singularities are due to a self-similar process, which 
leads to a different analytical expression for the proba- 
bilities on each interval l/(n' + 1) < I < 1/n'. Since the 
random process of generating the longest loops in our 
process is not self-similar (as indicated in the previous 
section), the reason for the singularities in the models 
of randomly broken objects does not hold in the case of 
the nth longest loop (as was suggested by Frachebourg 
et al. p9| ). Therefore, the probability of the nth longest 
loop does not necessarily have singularities at values of 
l = l/n'. 

Since in any specific RW, L\ > L2 > • • • > L n , the av- 
erage length of the nth longest loop (L n ) decreases (for 
fixed N) with increasing loop number n. There is no 
typical scale in the problem, and we may expect a power 
law dependence (L n ) ~ Nn~ a , with a > 0. The total 
number of steps in all the loops in any given RW cannot 
exceed N (i.e. J2 n L n < N), and therefore J2 n (L n ) < N. 
The convergence of the sum means that a > 1. We de- 
pict in Fig.0 the average reduced length (/„) vs. n on 
a logarithmic scale. To avoid systematic errors due to 
finite N, each value of (/„) in the graph was determined 
through an extrapolation: For each n, we plotted (l n ) 
vs. 1/N, and found the extrapolated value of (L) near 
1/N = 0. These values of (l n ) are depicted in Fig.||. The 
linear fit to the data points has a slope of —2.3 ± 0.4. We 
therefore conclude that as N — > 00: 



N 
n a 



where a = 2.3 ± 0.4 



(3) 



At the next section we will argue that a 
within the error limits of Eq. (ph. 



2, which is 




FIG. 3. Logarithm (base 10) of average reduced length of the 
nth longest loop as a function of logarithm of loop number n. 
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IV. NEUTRAL SEGMENTS IN FINITE CHAINS 

So far we have discussed only long chains and their 
properties in the A — > oo limit. There are several dif- 
ferences between the properties of infinitely long chains 
and finite size chains. First of all, in finite size chains the 
monomers that do not belong to any neutral segment 
constitute a finite part of the chain, as opposed to a van- 
ishing part as A — > oo. Secondly, in infinitely long chains 
the number of neutral segments is infinite, while for finite 
A at some point there are no more neutral segments. 

Let us consider the total number of monomers which 
do not belong to any neutral segment, for finite N. It 
is easily shown that the number of monomers left out 
from all the neutral segments in any specific random se- 
quence is exactly the excess charge Q of the correspond- 
ing PA: The excess charge cannot exceed the number of 
monomers left out, because, by definition, the neutral 
segments do not have excess charge. On the other hand, 
all the monomers left out from the neutral segments must 
be part of the excess charge: If, for instance, a PA has ex- 
cess positive charge, then no negative charge can be left 
out from all the neutral segments. (If there had been a 
group of negative charges, it would have joined a group of 
positive charges to make a neutral segment, or together 
with a group of positive charges, would have joined an 
existing neutral segment and make it longer). Since the 
r.m.s. excess charge of a randomly charged sequence of 
N charges of ±1 is equal to s/N, the r.m.s. number of 
monomers not in any neutral segment in the chain is also 
equal to vN, becoming a vanishing fraction of the chain 
as A — > oo. 

At some point the process of search for the next longest 
loop exhausts all the loops in the RW. We investigate 
this stage in the process, by analyzing the number n/ of 
loops in a RW. When A is infinite, the average length of 
the nth longest loop is given by (L n ) ~ Nn~ a . Applica- 
tion of this equality for finite N's would predict, for large 
enough n's, (L n ) < 1. Since this is not possible (the mini- 
mal length of a loop is two steps), we argue that Eq. (|3|) is 
valid, for finite N's, only to describe the average lengths 
of the longest rif loops. There is no typical scale to the 
problem of the total number of loops, and we therefore 
expect a power law dependence of (n/) on N. The length 
of the last loop, for all chain lengths, is usually very short 
(consisting of only few positive and negative charges), 
having length independent of N, i.e. (L nf ) ~ A , which 
means that (lnf) ~ A -1 . Since (l nf ) ~ nj a (substitut- 
ing n = rif in Eq. ||), we can expect: 

(rif) — N y , where y = — . (4) 

a 

Substituting a from Eq. (|J), leads to a value of y — 
0.43 ± 0.09. Investigating numerically the dependence of 
(rif) on N, we found that y = 0.46±0.06, which is within 
the error limits of the value predicted by Eqs. (||) and (||). 



At the end of this section we will argue that a = 2 and 
y = 0.5, values which are within the error limits of those 
deduced from the numeric data. In order to confirm the 
(rif) ~ \/~N relation, we show in Fig.|| the probability 
density of rif divided by \/N~ for several chain lengths 
(JV = 10 2 -i- 10 4 ). The division by VA causes a reason- 
able collapse of the graphs for different values of A to a 
single function, which is (almost) A— independent. We 
see that the probability density has a maximum when 
rif I \/N is close to zero (the most probable value of n/ 
is finite and independent of A), and it decreases to zero 
with increasing rif/yN. 
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FIG. 4. Probability density of rif, the total number of loops in 
a chain, divided by \/N, for N = 100, 300, 500, 1000, 3000, 5000, 
10000 (from right to left). 

Knowing the statistical properties of the chain, we try 
to construct a self-consistent complete picture, in which 
our numerical results fit together. We know that the 
total length of all the loops (Xm=i L n ) equals to the en- 
tire length of the chain minus the steps not in any loop 
(which are the excess charge, which are on the average 
y/~N). Dividing this equality by A, taking an average 
over the random sequences, we get: 

n f 

1-^ = <X>>- (5) 

On the other hand, from Eqs. (||) and ([|) we know that, 
omitting constants of order unity: 

/(»/} \ <"/> -N" 

( E ) = E < z «> - / n ~ adn - 1 - NV{1 ~ a) ■ 

\n=l I n=l Jn=1 

(6) 

Comparison of the powers of A in Eqs. (||) and (|6|) leads 
to 

v= %hx)' (7) 
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which together with Eq. (||) is satisfied by: 

" = 2 , y = \. (8) 

These equalities are satisfied by the values of y = 0.46 ± 
0.06, and a = 2.3 ±0.4 obtained numerically, and consti- 
tute a self-consistent picture, in which the average con- 
formational properties of the constructed ground state fit 
together. 

V. PHYSICAL PROPERTIES OF THE GROUND 
STATE 

In this section we investigate some of the physical char- 
acteristics of the constructed ground state of randomly 
charged PA's. We focus on the linear size R and on the 
surface area S of the proposed ground state, trying to ex- 
plain their dependencies on A through the self consistent 
picture constructed in the previous section. We define i?, 
the linear size of the chain, according to the picture of 
Fig. [0: The neutral segments in the chain compact into 

1/3 

globules (each with a linear size of -Rsegment ~ -^segment)- 
If all the globules are linearly packed then the total lin- 
ear size is the sum of the linear sizes of all the globules 
(R = -Rsogmont)- The linear size of the chain must 
include the monomers not in any neutral segment (the 
'strings' in the necklace, which are the total excess charge 
Q). We therefore get a means to describe the chain's size: 

"/ 

R^J2 L n 3+ Q- ( 9 ) 

71=1 

(Here, and throughout this section, we omit prefactors 
of order unity.) It is evident that the generated state 
captures some essential features of the ground state sug- 
gested by the necklace model: The necklace type struc- 
ture is compact (i.e. R ~ A 1 / 3 ) when the PA is neu- 
tral (the longest neutral segment is the entire chain) or 
has very small excess charge, and begins to stretch as 
the excess charge increases (the charged strings become 
longer). Finally, the PA becomes completely stretched 
(i.e. R ~ A) for the fully charged polymer. 

Investigating the average chain's linear size depen- 
dence on N, we get the dependence: 

(R) ~ N u , where v = 0.50 ± 0.01 . (10) 

Fig. H depicts the probability density of R, divided by 
y/N for several values of N. From the data collapse we 
deduce that the iV— dependence in Eq. ([h]) is valid not 
just for the average linear size, but represents a scaling 
of the entire probability density. The v ~ ^ power in 
Eq. (|l(]) means that the chain is not compact (although 
the distribution is 'peaked' near the lowest possible value 
of R) , and is not completely stretched, but has a linear 
size as an ideal RW with A steps. 
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FIG. 5. Probability density of R, the linear size of the en- 
tire chain, for several chain lengths: N =100 (dotted line), 300 
(dot-dashed line), 1000 (dashed line), 3000 (solid line), 10000 (thick 
points). The minimal possible value of R (Ro = N 1 / 3 ) is sub- 
tracted from R, and the result is divided by vJV to collapse the 
data. 

We can explain the dependence in Eq. ([!(]), by as- 
suming that X^i^™ 1 ^ 3 ) an d J2n (Ln) 1 ^ 3 have the same 
N— dependence, and by using the power laws of (rif) and 
(L n ) (Eqs. | and @), with a = 2 and y = \ (Eq. |): 

«/ <«/> „JV« , M x 1/3 

£<^ /3 >~£c^> 1/3 ~/ (£) ^°- 5 . 

n=l n=l Jn=1 V J 

(11) 

In order to confirm this dependence, we investigated 
(Xm=i Li/ 3 } as a function of A, and found the power 
law dependence with exponent 0.49 ±0.02, in accordance 
with the prediction of Eq. (pT|). We see that the average 
linear chain size (R) in Eq7(p|) is a sum of two terms, 
each proportional to y/N. 

Similarly to the definition of R, we can define the sur- 
face area S of the chain. Each neutral segment compacts 

2/3 

into a globule of surface area ~ £ S c gmcn t, au the globules 
are linearly packed, and the surface area of the 'strings' 
is proportional to the number of monomers not in any 
neutral segment, which is the excess charge. We can 
therefore define: 

S^J2 L n /3 +Q- ( 12 ) 

71=1 

S can have values ranging from A 2 / 3 , for a neutral chain, 
to N, for a completely charged chain. These limits in- 
deed correspond to the expected behavior of PA. Obvi- 
ously S > A 2 / 3 . We analyzed the A-dependence of S, 
and found that: 

(S) ~ A a67±a01 . (13) 
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When we subtract from S its minimal value, and divide 
the result by iV 2 / 3 , we get a distribution which is iden- 
tical for all N. This dependence means that the average 
surface energy of the generated structure has the same 
TV— dependence as the surface energy of a single compact 
globule (or several compact globules, each containing a 
finite part of the chain). From the same arguments that 
led to Eq. @, we get: 

<«/> -AT* / /v\ 2/3 

n=l Jn=1 ^ U ' 

(14) 

This power of N is in accordance with the numerically 
obtained value of 0.67 ± 0.01, and indicates that the N- 
dependence of the average surface area is determined by 
the neutral segments (i.e. the 'beads' in the necklace), 
and is not affected by the excess charge (the 'strings' in 
the necklace). 

o 2 

We investigated the expression S + ^ , which has the 
same A-dependence as the energy of the generated struc- 
ture. (We considered the energy terms of Eq. (|l|), and 
omitted the condensation term, since it is the same for all 
structures of a given N). Exploring the N— dependence 

of S + ^j- (denoted as E), we found that: 

(£!) = (5 + 9L) „ ^o.eeio.oi (15) 
R 

When subtracting from E its minimal value and dividing 
the result by A 2 / 3 , we get a probability density which 
is identical for all N. This dependence means that the 
total energy is very low: The surface energy term (S) 
gets its minimal value (~ iV 2//3 ), and R ~ A 1 / 2 , thus 
bringing the electrostatic energy term to a point where 
it does not affect the A-dependence of the total energy. 
The total energy behaves very much like the surface en- 
ergy, as if the chain is constructed of a single compact 
neutral globule. 



of 'beads' and 'strings' in the necklace, the spatial ex- 
tent, the surface area and energy). This structure is 
compact when the chain is neutral or weakly charged, 
and stretches as the chain becomes charged. We have 
found that the ground state structure has a very low en- 
ergy, which depends on the number of monomers as the 
energy of a single compact neutral globule. We have also 
shown that the unrestricted average of the linear size of 
the polymer in the ground state depends on the number 
of monomers as the linear size of an ideal chain, with 
a critical exponent of v = 0.50 ± 0.01. Although the 
qualitative behavior of our distribution for the linear size 
is similar to this obtained in Jl5| (it is peaked near its 
smallest possible value, and has a tail, which determines 
the asymptotic behavior of u), Kantor and Kardar fHf ] 
concluded that the average linear size increases with N 
at least as fast as a self-avoiding walk (i.e. v > 0.6). We 
believe that it is worthwhile to slightly alter the way in 
which compact globules are formed within our model (by 
allowing for weakly charged globules or by not forcing all 
the neutral segments to completely compact), in order to 
try to reproduce this 'swelling' of the average chain. 

The investigation of the size distribution of neutral seg- 
ments through the analogy to 1-d RWs, was limited to 
a particular class of RWs, in which a unit displacement 
appears at each step (i.e. each monomer in the chain 
is charged ±1). In a future publication p8|j , we define 
the problem of longest loops in the limit where the RW 
becomes a true Gaussian walk, and prove through a scal- 
ing process the universality of the probability densities 
of longest loops. This universality means that the prob- 
ability densities of longest loops are independent of the 
number of steps and of the nature of the single step of 
the RW. Therefore, the results obtained for a specific 
case of randomly charged PA's are valid for all randomly 
charged polymers, in which (for long chains) the charge 
distribution is an unbiased Gaussian. 
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